feat: support input database without patient extensions #220

mikix · 2024-04-23T19:47:52Z

Features:

The core__patient no longer requires the presence of race/ethnicity extensions to build (it now uses the deep-schema checking code)

Bug Fixes:

If there are both detailed and ombCategory values for an extension, no longer report both - just prefer the ombCategory version.

Checklist

Consider if documentation (like in docs/) needs to be updated
Consider if tests should be added
Update template repo if there are changes to study configuration

cumulus_library/studies/core/builder_patient.py

mikix · 2024-04-23T19:51:47Z

cumulus_library/template_sql/extension_denormalize.sql.jinja

-                    PARTITION BY id, system
+                    PARTITION BY id


This is a bug fix, afaict - when you have both ombCategory and detailed, this was partitioning in such a way that both would have available_priority = 1 and both show up in the extension table. Then when you joined with patients, you'd get duplicate patient rows.

i don't think that's a bug - it's left to the study author to determine the appropriate system for their use case. the core study preserves what it finds, and the user should use distinct() on patient IDs when they're counting

We talked about this on slack - we think it is a bug, and will leave the code change in. But we have lots of questions about best approaches for race/ethnicity. Should race_detailed be a separate column? What to do with text? Problems for another day.

cumulus_library/template_sql/sql_utils.py

mikix · 2024-04-24T17:15:06Z

tests/regression/reference/core__count_medicationrequest_month.csv

Afaict, this changed due to a re-run of the regression ETL on updated sample-bulk-fhir-datasets data (we inlined some medication codes in Aug '23)

cumulus_library/studies/core/builder_patient.py

dogversioning · 2024-04-24T17:21:21Z

cumulus_library/studies/core/reference_sql/builder_documentreference.sql

@@ -153,7 +153,6 @@ CREATE TABLE core__documentreference AS
 WITH temp_documentreference AS (
    SELECT DISTINCT
        dr.id,
-        dr.type,


are there some dangling changes from a previous PR in here?

Yeah, looks like I forgot to run generate-sql at the tail end of my deep-schema-support PR, after review changes.

dogversioning · 2024-04-24T17:24:01Z

cumulus_library/template_sql/extension_denormalize.sql.jinja

-                    PARTITION BY id, system
+                    PARTITION BY id


i don't think that's a bug - it's left to the study author to determine the appropriate system for their use case. the core study preserves what it finds, and the user should use distinct() on patient IDs when they're counting

cumulus_library/template_sql/sql_utils.py

Features: - The core__patient no longer requires the presence of race/ethnicity extensions to build (it now uses the deep-schema checking code) Bug Fixes: - If there are both detailed and ombCategory values for an extension, no longer report both - just prefer the ombCategory version.

mikix force-pushed the mikix/patient-extensions-low-schema branch from 0d0b56e to 14780bc Compare April 23, 2024 19:49

mikix commented Apr 23, 2024

View reviewed changes

cumulus_library/studies/core/builder_patient.py Show resolved Hide resolved

mikix commented Apr 23, 2024

View reviewed changes

cumulus_library/template_sql/sql_utils.py Show resolved Hide resolved

mikix force-pushed the mikix/patient-extensions-low-schema branch from 14780bc to 2f65227 Compare April 23, 2024 19:54

mikix marked this pull request as ready for review April 23, 2024 19:54

mikix force-pushed the mikix/patient-extensions-low-schema branch 5 times, most recently from 8533410 to 45da810 Compare April 24, 2024 17:05

mikix commented Apr 24, 2024

View reviewed changes

dogversioning approved these changes Apr 24, 2024

View reviewed changes

mikix force-pushed the mikix/patient-extensions-low-schema branch from 45da810 to f278bba Compare April 24, 2024 19:09

mikix merged commit b6a79fe into main Apr 24, 2024
3 checks passed

mikix deleted the mikix/patient-extensions-low-schema branch April 24, 2024 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support input database without patient extensions #220

feat: support input database without patient extensions #220

mikix commented Apr 23, 2024 •

edited

Loading

mikix Apr 23, 2024

dogversioning Apr 24, 2024

mikix Apr 24, 2024

mikix Apr 24, 2024

dogversioning Apr 24, 2024

mikix Apr 24, 2024

dogversioning Apr 24, 2024

feat: support input database without patient extensions #220

feat: support input database without patient extensions #220

Conversation

mikix commented Apr 23, 2024 • edited Loading

Checklist

mikix Apr 23, 2024

Choose a reason for hiding this comment

dogversioning Apr 24, 2024

Choose a reason for hiding this comment

mikix Apr 24, 2024

Choose a reason for hiding this comment

mikix Apr 24, 2024

Choose a reason for hiding this comment

dogversioning Apr 24, 2024

Choose a reason for hiding this comment

mikix Apr 24, 2024

Choose a reason for hiding this comment

dogversioning Apr 24, 2024

Choose a reason for hiding this comment

mikix commented Apr 23, 2024 •

edited

Loading